TextControlGAN: Text-to-Image Synthesis with Controllable Generative Adversarial Networks
نویسندگان
چکیده
Generative adversarial networks (GANs) have demonstrated remarkable potential in the realm of text-to-image synthesis. Nevertheless, conventional GANs employing conditional latent space interpolation and manifold (GAN-CLS-INT) encounter challenges generating images that accurately reflect given text descriptions. To overcome these limitations, we introduce TextControlGAN, a controllable GAN-based model specifically designed for synthesis tasks. In contrast to traditional GANs, TextControlGAN incorporates neural network structure, known as regressor, effectively learn features from texts. further enhance learning performance data augmentation techniques are employed. As result, generator within can texts more effectively, leading production closely adhere textual conditions. Furthermore, by concentrating discriminator’s training efforts on GAN exclusively, overall quality generated is significantly improved. Evaluations conducted Caltech-UCSD Birds-200 (CUB) dataset demonstrate surpasses cGAN-based GAN-INT-CLS model, achieving 17.6% improvement Inception Score (IS) 36.6% reduction Fréchet Distance (FID). supplementary experiments utilizing 128 × resolution images, exhibits ability manipulate minor bird according These findings highlight powerful tool high-quality, text-conditioned paving way future advancements field
منابع مشابه
Improvement of generative adversarial networks for automatic text-to-image generation
This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...
متن کاملGenerative Adversarial Text to Image Synthesis
Automatic synthesis of realistic images from text would be interesting and useful, but current AI systems are still far from this goal. However, in recent years generic and powerful recurrent neural network architectures have been developed to learn discriminative text feature representations. Meanwhile, deep convolutional generative adversarial networks (GANs) have begun to generate highly com...
متن کاملMedical Image Synthesis with Context-Aware Generative Adversarial Networks
Computed tomography (CT) is critical for various clinical applications, e.g., radiotherapy treatment planning and also PET attenuation correction. However, CT exposes radiation during acquisition, which may cause side effects to patients. Compared to CT, magnetic resonance imaging (MRI) is much safer and does not involve any radiations. Therefore, recently, researchers are greatly motivated to ...
متن کاملStackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
Although Generative Adversarial Networks (GANs) have shown remarkable success in various tasks, they still face challenges in generating high quality images. In this paper, we propose Stacked Generative Adversarial Networks (StackGAN) aimed at generating high-resolution photorealistic images. First, we propose a two-stage generative adversarial network architecture, StackGAN-v1, for textto-imag...
متن کاملControllable Generative Adversarial Network
Although it is recently introduced, in last few years, generative adversarial network (GAN) has been shown many promising results to generate realistic samples. However, it is hardly able to control generated samples since input variables for a generator are from a random distribution. Some attempts have been made to control generated samples from GAN, but they have shown moderate results. Furt...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13085098